Computing NFA Intersections in Map-Reduce
نویسندگان
چکیده
Nondeterministic Finite-state Automata (NFA) are simple, yet powerful devices that model a plethora of computationally oriented phenomena. One of the advantages of NFA’s is that they are closed under several operations, such as concatenation, intersection, difference, and homomorphic images. This makes NFA’s ideally suited for a modular approach, for instance in the context of protocol design and web service composition. A simple, but illustrative example of an e-commerce application designed from components can be found in Chapter 2 in [5]. The salient operation here is the intersection of several finite state automata. Problems relating to NFA’s have been widely studied in the literature. One of the main issues for the NFA intersection problem is that the size of the output NFA is the product of the size of all input NFA’s. There is not much hope for improvement, since testing for emptiness of the intersection of a set languages represented by NFA’s is known to be PSPACE-complete [8]. The most commonly used algorithm for computing the intersection NFA is to use the Cartesian construct for product automata. If there are m input NFA’s each having n states, the product NFA will have n states. It therefore would be important to come up with good distributed algorithms for the problem. Google introduced map-reduce as a parallel programming model [4] that can work over large clusters of commodity computers. Map-reduce provides a high-level framework for designing and implementing such parallelism. A growing number of papers deal with mapreduce algorithms for various problems, for instance related to graphs [12, 9, 3, 11], and related to relational joins [2, 6, 7]. In this paper we investigate the problem of implementing the Cartesian construct in map-reduce. We follow the optimization approach of Afrati et al. [1] and analyze the replication rate required for computing the
منابع مشابه
Left is Better than Right for Reducing Nondeterminism of NFAs
We study the NFA reductions by invariant equivalences. It is well-known that the NFA minimization problem is PSPACE-complete. Therefore, there have been approaches to reduce the size of NFAs in low polynomial time by computing invariant equivalence and merging the states within same equivalence class. Here we consider the nondeterminism reduction of NFAs by invariant equivalences. We, in partic...
متن کاملMinimal Nondeterministic Finite Automata and Atoms of Regular Languages
We examine the NFA minimization problem in terms of atomic NFA’s, that is, NFA’s in which the right language of every state is a union of atoms, where the atoms of a regular language are non-empty intersections of complemented and uncomplemented left quotients of the language. We characterize all reduced atomic NFA’s of a given language, that is, those NFA’s that have no equivalent states. Usin...
متن کاملAsymptotic algorithm for computing the sample variance of interval data
The problem of the sample variance computation for epistemic inter-val-valued data is, in general, NP-hard. Therefore, known efficient algorithms for computing variance require strong restrictions on admissible intervals like the no-subset property or heavy limitations on the number of possible intersections between intervals. A new asymptotic algorithm for computing the upper bound of the samp...
متن کاملA New NFA Reduction Algorithm for State Minimization Problem
The problem of creating a minimal NFA is a primal (fundamental) problem. Reducing the size of NFA by using NFA Reduction Algorithm has been shown to reduce importantly the search time. This paper innovate a new NFA reduction algorithm for the state minimization of NFA. The analysis of the proposed algorithm is given and also demonstrates the results of the numerical experiments. This paper conc...
متن کاملMissing Values in Nonlinear Factor Analysis
Tapani Raiko and Harri Valpola Helsinki University of Te hnology, Neural Networks Resear h Centre P.O.Box 5400, FIN-02015 HUT, Espoo, Finland E-mail: Tapani.Raiko hut.fi, Harri.Valpola hut.fi URL: http://www. is.hut.fi/ ABSTRACT The properties of the nonlinear fa tor analysis (NFA) model are studied by measuring how well it re onstru ts missing values in observations. The NFA model uses a multi...
متن کامل